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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/724 , 586 



DATE: 09/04/2002 
TIME: 11:57:34 




ENTERED 



Input Set : N:\Crf3\RULE60\09724586.raw 
Output Set: N:\CRF4\09042002\I724586.raw 

APPLICANT: Sakowicz, Roman 
Goldstein, Lawrence S. B. 

The Regents of the University of California 

TITLE OF INVENTION: Identification and Expression of a Novel Kinesin Motor 
Protein 

FILE REFERENCE: 18557C-000710US 
CURRENT APPLICATION NUMBER: 09/724,586 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US/09/235, 416 
PRIOR FILING DATE: 1999-01-22 

PRIOR APPLICATION NUMBER: WO PCT/US99/01355 
PRIOR FILING DATE: 1999-01-22 
PRIOR APPLICATION NUMBER: US 60/072,361 
PRIOR FILING DATE: 1998-01-23 
NUMBER OF SEQ ID NOS : 7 
SOFTWARE: Patentln Ver . 2.0 
SEQ ID NO: 1 
LENGTH: 784 
TYPE : PRT 

ORGANISM: Thermomyces lanuginosus 
FEATURE : 

OTHER INFORMATION: TL-gamma ATP-dependent plus end-directed 
microtubule motor protein 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (1) . . (357) 

OTHER INFORMATION: kinesin-like microtubule motor domain 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (358) . . (442) 

OTHER INFORMATION: neck domain links motor domain to stalk domain 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION: (443) . . (601) 

OTHER INFORMATION: stalk domain, unc-104 family domain 
FEATURE : 

NAME/KEY: DOMAIN 
LOCATION : ( 602 ) . . ( 784 ) 
OTHER INFORMATION: tail domain 
SEQUENCE : 1 

Met Ser Gly Gly Gly Asn lie Lys Val Val Val Arg Val Arg 

15 10 
Asn Ala Arg Glu lie Asp Arg Gly Ala Lys Cys lie Val Arg 



Pro 
15 
Met 



Phe 



Glu 
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48 20 25 30 

49 Gly Asn Gin Thr lie Leu Thr Pro Pro Pro Gly Ala Glu Glu Lys Ala 

50 35 40 45 

51 Arg Lys Ser Gly Lys Thr lie Met Asp Gly Pro Lys Ala Phe Ala Phe 

52 50 55 60 

53 Asp Arg Ser Tyr Trp Ser Phe Asp Lys Asn Ala Pro Asn Tyr Ala Arg 

54 65 70 75 80 

55 Gin Glu Asp Leu Phe Gin Asp Leu Gly Val Pro Leu Leu Asp Asn Ala 

56 85 90 95 

57 Phe Lys Gly Tyr Asn Asn Cys lie Phe Ala Tyr Gly Gin Thr Gly Ser 

58 100 105 110 

59 Gly Lys Ser Tyr Ser Met Met Gly Tyr Gly Lys Glu His Gly Val lie 

60 115 120 125 

61 Pro Arg lie Cys Gin Asp Met Phe Arg Arg lie Asn Glu Leu Gin Lys 

62 130 135 140 

6 3 Asp Lys Asn Leu Thr Cys Thr Val Glu Val Ser Tyr Leu Glu lie Tyr 

64 145 150 155 160 

6 5 Asn Glu Arg Val Arg Asp Leu Leu Asn Pro Ser Thr Lys Gly Asn Leu 

66 165 170 175 

6 7 Lys Val Arg Glu His Pro Ser Thr Gly Pro Tyr Val Glu Asp Leu Ala 

68 180 185 190 

69 Lys Leu Val Val Arg Ser Phe Gin Glu lie Glu Asn Leu Met Asp Glu 

70 195 200 205 

71 Gly Asn Lys Ala Arg Thr Val Ala Ala Thr Asn Met Asn Glu Thr Ser 

72 210 215 220 

7 3 Ser Arg Ser His Ala Val Phe Thr Leu Thr Leu Thr Gin Lys Trp His 

74 225 230 235 240 

75 Asp Glu Glu Thr Lys Met Asp Thr Glu Lys Val Ala Lys lie Ser Leu 

76 245 250 255 

77 Val Asp Leu Ala Gly Ser Glu Arg Ala Thr Ser Thr Gly Ala Thr Gly 

78 260 265 270 

79 Ala Arg Leu Lys Glu Gly Ala Glu lie Asn Arg Ser Leu Ser Thr Leu 

80 275 280 285 

81 Gly Arg Val lie Ala Ala Leu Ala Asp Met Ser Ser Gly Lys Gin Lys 

82 290 295 300 

83 Lys Asn Gin Leu Val Pro Tyr Arg Asp Ser Val Leu Thr Trp Leu Leu 

84 305 310 315 320 

85 Lys Asp Ser Leu Gly Gly Asn Ser Met Thr Ala Met lie Ala Ala lie 

86 325 330 335 

87 Ser Pro Ala Asp lie Asn Phe Glu Glu Thr Leu Ser Thr Leu Arg Tyr 

88 340 345 350 

89 Ala Asp Ser Ala Lys Arg lie Lys Asn His Ala Val Val Asn Glu Asp 

90 355 360 365 

91 Pro Asn Ala Arg Met lie Arg Glu Leu Lys Glu Glu Leu Ala Gin Leu 

92 '370 375 380 

93 Arg Ser Lys Leu Gin Ser Ser Gly Gly Gly Gly Gly Gly Ala Gly Gly 

94 385 390 395 400 

95 Ser Gly Gly Pro Val Glu Glu Ser Tyr Pro Pro Asp Thr Pro Leu Glu 

96 405 410 415 
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97 Lys Gin lie Val Ser lie Gin Gin Pro Asp Ala Thr Val Lys Lys Met 

98 420 425 430 

99 Ser Lys Ala Glu lie Val Glu Gin Leu Asn Gin Ser Glu Lys Leu Tyr 

100 435 440 445 

101 Arg Asp Leu Asn Gin Thr Trp Glu Glu Lys Leu Ala Lys Thr Glu Glu 

102 450 455 460 

103 He His Lys Glu Arg Glu Ala Ala Leu Glu Glu Leu Gly He Ser He 

104 465 470 475 480 

105 Glu Lys Gly Phe Val Gly Pro Tyr His Ser Lys Glu Met Pro His Leu 

106 485 490 495 

107 Val Asn Leu Ser Asp Asp Pro Leu Leu Ala Glu Cys Leu Val Tyr Asn 

108 500 505 510 

109 He Lys Pro Gly Gin Thr Arg Val Gly Asn Val Asn Gin Asp Thr Gin 

110 515 520 525 

111 Ala Glu He Arg Leu Asn Gly Ser Lys He Leu Lys Glu His Cys Thr 

112 530 535 540 

113 Phe Glu Asn Val Asp Asn Val Val Thr He Val Pro Asn Glu Lys Ala 

114 545 550 555 560 

115 Ala Val Met Val Asn Gly Val Arg He Asp Lys Pro Thr Arg Leu Arg 

116 565 570 575 

117 Ser Gly Tyr Arg He He Leu Gly Asp Phe His He Phe Arg Phe Asn 

118 580 585 590 

119 His Pro Glu Glu Ala Arg Ala Glu Arg Gin Glu Gin Ser Leu Leu Arg 

120 595 600 605 

121 His Ser Val Thr Asn Ser Gin Leu Gly Ser Pro Ala Pro Gly Arg His 

122 610 615 620 

123 Asp Arg Thr Leu Ser Lys Ala Gly Ser Asp Ala Asp Gly Asp Ser Arg 

124 625 630 635 640 
12 5 Ser Asp Ser Pro Leu Pro His Phe Arg Gly Lys Asp Ser Asp Trp Phe 

126 645 650 655 

127 Tyr Ala Arg Arg Glu Ala Ala Ser Ala He Leu Gly Leu Asp Gin Lys 

128 660 665 670 

129 He Ser His Leu Thr Asp Asp Glu Leu Asp Ala Leu Phe Asp Asp Val 

130 675 680 685 

131 Gin Lys Ala Arg Ala Val Arg Arg Gly Leu Val Glu Asp Asn Glu Asp 

132 690 695 700 

133 Ser Asp Ser Gin Ser Ser Phe Pro Val Arg Asp Lys Tyr Met Ser Asn 

134 705 710 715 720 

135 Gly Thr He Asp Asn Phe Ser Leu Asp Thr Ala He Thr Met Pro Gly 

136 725 730 735 

137 Thr Pro Arg Ser Asp Asp Asp Gly Asp Ala Leu Phe Phe Gly Asp Lys 

138 740 745 750 

139 Lys Ser Lys Gin Asp Ala Ser Asn Val Asp Val Glu Glu Leu Arg Gin 

140 755 760 765 

141 Gin Gin Ala Gin Met Glu Glu Ala Leu Lys Thr Ala Lys Gin Glu Phe 

142 770 775 780 



144 <210> SEQ ID NO: 2 

145 <211> LENGTH: 2352 

146 <212> TYPE: DNA 
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147 <213> ORGANISM: Thermomyces lanuginosus 

14 8 <22 0> FEATURE: 

149 <223> OTHER INFORMATION: TL-gamma ATP-dependent plus end-directed 

150 microtubule motor protein 

151 <400> SEQUENCE: 2 

152 atgtcgggcg gtggaaatat caaggtggtg gtgcgggtac gcccgttcaa cgcccgagaa 6 0 

153 atcgaccgtg gcgcaaaatg tattgtgcgg atggaaggaa atcaaaccat cctcacccct 120 

154 cctccgggtg ccgaagagaa ggcgcgtaaa agtggcaaaa ctattatgga tggcccgaag 180 

155 gcatttgcgt tcgatcggtc gtattggtcc tttgacaaga atgctcccaa ctatgcgaga 240 

156 caggaagacc tattccaaga tctcggagtc ccgcttctgg ataatgcatt caagggttat 300 

157 aacaattgta tcttcgccta cggtcagacc ggttcgggca agtcctattc aatgatgggc 360 

158 tatggcaagg agcatggcgt gatcccgcgg atttgccagg acatgttccg gcgtattaat 420 

159 gaactgcaga aggacaagaa cctcacttgc accgtcgaag tttcgtactt ggaaatttac 4 80 

160 aatgaacgag tgcgagactt gctgaatccg tcgacaaagg ggaatctcaa ggtccgagaa 540 

161 cacccgtcga ccggccccta cgtggaggac ttggcgaagc tggtcgtgcg atcattccaa 6 00 

162 gaaatcgaaa atctcatgga tgagggcaac aaagccagaa cggttgccgc cacaaacatg 6 60 

163 aacgagacat ccagtcgatc ccacgccgtc ttcactttga ccttgacgca aaagtggcat 720 

164 gatgaagaga ccaaaatgga cacagagaag gttgcgaaga tcagtctggt agatttggcg 780 

165 ggttctgagc gagcaacgtc caccggagct actggagcgc gactgaagga gggtgcagag 840 

166 atcaaccgct cactttcgac cctaggtcgt gtgattgcag cgctagcgga tatgtcgtcg 900 

167 ggaaaacaga agaagaatca gttagtacct taccgagatt cggtactgac gtggcttctg 960 

168 aaggactcct tgggaggcaa ctcgatgacc gccatgattg ccgccatttc gcctgctgat 1020 

169 attaactttg aagagactct cagtaccctt cgatatgcgg actctgcgaa gcgaatcaag 1080 

170 aaccacgcag tggtcaatga agacccgaac gcgcggatga tccgcgagtt gaaggaggaa 1140 

171 ctcgcgcagc tgaggagcaa actccagagc agtggtggag gtggaggtgg tgcaggaggt 1200 

172 tctggcgggc cagtggagga atcgtacccg cccgacacgc cgctcgagaa gcaaatcgtg 1260 

173 tcgattcagc agccggatgc gacagtcaag aaaatgagca aggcagaaat cgtggagcaa 1320 

174 ctgaaccaga gtgagaagct ctatcgggat ctcaatcaga cctgggaaga gaagctggcc 1380 

175 aagaccgagg aaatccacaa ggaacgagaa gcggcgctcg aggagctggg tatcagcatc 1440 

176 gaaaagggct ttgttggccc ttaccactcc aaagaaatgc cacatctagt caacttgagc 1500 

177 gatgatcctc ttctggctga gtgtcttgtc tacaacatca agcccgggca gacaagggtt 1560 

178 ggaaacgtca accaagatac acaagcggaa attcgtctga acggttcgaa gatcctgaaa 1620 

179 gaacactgta cgtttgaaaa tgtggacaac gttgtgacca tcgtgccaaa cgagaaggct 1680 

180 gctgtcatgg tgaacggcgt gcgaatcgac aagcctactc gcctccgcag cggctacagg 1740 

181 atcatcctgg gcgatttcca catttttcga ttcaaccatc cggaagaagc tcgtgcggaa 1800 

182 cggcaagaac aatccttgct tcgccattct gtcaccaaca gtcagttggg ttcgcctgct 1860 

183 ccaggccgtc acgaccggac actgagcaag gcgggttcgg atgcggacgg cgattctcgc 1920 

184 tcagattctc ctttgccgca ctttcgtgga aaggatagcg actggttcta tgctcgcagg 1980 

185 gaagctgcta gcgcgatcct agggttggat cagaagatct ctcatctgac agatgacgag 2040 

186 ttggatgcat tatttgacga tgttcagaaa gcgcgggcag ttcgtcgtgg gctggtcgaa 2100 

187 gacaacgaag atagcgattc gcagagttcg tttccggtcc gtgacaaata catgtccaat 2160 

188 ggaaccattg ataatttctc gctcgatacc gccattacta tgccgggtac ccctcgtagt 2220 

189 gatgacgacg gtgacgcgct gttttttggt gataagaagt cgaaacagga tgcgtctaat 2280 

190 gttgatgttg aggagttgcg tcaacagcag gctcagatgg aagaagccct gaaaacagcg 2 340 

191 aagcaggaat tc 2352 

193 <210> SEQ ID NO: 3 

194 <211> LENGTH: 21 

195 <212> TYPE: DNA 

196 <213> ORGANISM: Artificial Sequence 
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197 <220> FEATURE: 

198 <223> OTHER INFORMATION: Description of Artificial Sequence : primer 

199 <400> SEQUENCE: 3 

200 atgtcgggcg gtggaaatat c .21 

202 <210> SEQ ID NO: 4 

203 <211> LENGTH: 23 

204 <212> TYPE: DNA 

205 <213> ORGANISM: Artificial Sequence 

206 <220> FEATURE: 

207 <223> OTHER INFORMATION: Description of Artificial Sequence : primer 

208 <400> SEQUENCE: 4 

209 gaattcctgc ttcgctgttt tea 23 

211 <210> SEQ ID NO: 5 

212 <211> LENGTH: 30 

213 <212> TYPE: DNA 

214 <213> ORGANISM: Artificial Sequence 

215 <220> FEATURE: 

216 <223> OTHER INFORMATION: Description of Artificial Sequence : degenerate 

217 forward primer 

218 <220> FEATURE: 

219 <221> NAME/KEY: modif ied_base 

220 <222> LOCATION: (25) 

221 <223> OTHER INFORMATION: n = a, c, g or t 

222 <400> SEQUENCE: 5 

W--> 223 gcgcggatcc atyttygcht ayggncarac 30 

225 <210> SEQ ID NO : 6 

226 <211> LENGTH: 30 

227 <212> TYPE: DNA 

228 <213> ORGANISM: Artificial Sequence 

229 <220> FEATURE: 

230 <223> OTHER INFORMATION: Description of Artificial Sequence : degenerate 
2 31 reverse primer 

2 32 <22 0> FEATURE: 

233 <221> NAME/KEY: modif ied_base 

234 <222> LOCATION: (16) 

235 <223> OTHER INFORMATION: n = a, c, g or t 

236 <220> FEATURE: 

2 37 <221> NAME/KEY: modif ied_base 

238 <222> LOCATION: (28) 

239 <223> OTHER INFORMATION: n = a, c, g or t 
24 0 <4 00> SEQUENCE: 6 

W--> 241 gegegaatte tedgancedg cvarrtcnac 30 

243 <210> SEQ ID NO : 7 

244 <211> LENGTH: 30 

245 <212> TYPE: DNA 

246 <213> ORGANISM: Artificial Sequence 

247 <220> FEATURE: 

248 <223> OTHER INFORMATION: Description of Artificial Sequence : degenerate 
24 9 reverse primer 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/724,586 



DATE: 09/04/2002 
TIME: 11:57:36 



Input Set : N:\Crf3\RULE60\09724586.raw 
Output Set: N:\CRF4\09042002\I724586.raw 



Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing, Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:5; N Pos . 25 
Seq#:6; N Pos. 16,28 
Seq#:7; N Pos. 16,28 

Invalid Line Length; 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 4 
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VERIFICATION SUMMARY DATE: 09/04/2002 

PATENT APPLICATION: US/09/724 f 586 TIME: 11:57:36 



Input Set : 
Output Set: 

L:223 M:341 W: (46) "n" or "Xaa" 
L:241 M:341 W: (46) "n" or "Xaa" 
L:259 M:341 W: (46) "n" or "Xaa" 



N : \Cr f 3\RULE6 0\ 09724586. raw 
N:\CRF4\09042002\I724586.raw 

used, for SEQ ID# : 5 after pos . : 0 
used, for SEQ ID# : 6 after pos . : 0 
used, for SEQ ID# : 7 after pos . : 0 
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